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ABSTRACT 

This paper presents a distributed algorithm on wireless ad- 
hoc networks that runs in polylogarithmic number of rounds 
in the size of the network and constructs a linear size, lightweight, 
(1 + e)-spanner for any given e > 0. A wireless network 
is modeled by a d- dimensional a-quasi unit ball graph (a- 
UBG), which is a higher dimensional generalization of the 
standard unit disk graph (UDG) model. The d-dimensional 
a-UBG model goes beyond the unrealistic "flat world" as- 
sumption of UDGs and also takes into account transmission 
errors, fading signal strength, and physical obstructions. 
The main result in the paper is this: for any fixed e > 0, 
< a < 1, and d > 2 there is a distributed algorithm run- 
ning in 0(log ndog* n) communication rounds on an n-node, 
d-dimensional a-UBG G that computes a (1 + e)-spanner G' 
of G with maximum degree A(G') = 0(1) and total weight 
w(G') = 0(w(MST(G)). This result is motivated by the 
topology control problem in wireless ad-hoc networks and 
improves on existing topology control algorithms along sev- 
eral dimensions. The technical contributions of the paper 
include a new, sequential, greedy algorithm with relaxed 
edge ordering and lazy updating, and clustering techniques 
for filtering out unnecessary edges. 

Categories and Subject Descriptors: C.2.4 [Computer- 
Communication Networks]: Distributed Systems 
General Terms: Algorithms, Performance, Theory. 
Keywords: Spanners, Topology control, Wireless ad-hoc 
networks, Unit ball graphs. 

1. INTRODUCTION 

Let G = ( V, E) be a graph with edge weights w : E — > 1Z + . 
For t > 1, a t-spanner of G is a spanning subgraph G' of 
G such that for all pairs of vertices u, v £ V, the length of 
a shortest uu-path in G' is at most t times the length of a 
shortest uu-path in G. The problem of constructing a sparse 
i-spanner, for small t, of a given graph G has been exten- 
sively studied by researchers in distributed computing and 
computational geometry and more recently by researchers in 
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ad-hoc wireless networks. In this paper we present a fast dis- 
tributed algorithm for constructing a linear size, lightweight 
i-spanner of bounded degree for any given t > 1, on wireless 
networks. Below, we describe our result more precisely. 

1.1 Network model 

We model wireless networks using d-dimensional quasi 
unit ball graphs. For any fixed a, < a < 1 and inte- 
ger d > 2, a d-dimensional a-quasi unit ball graph (a-UBG, 
in short) is a graph G = (V, E) whose vertex set V can be 
placed in one-one correspondence with a set of points in the 
d-dimensional Euclidean space and whose edge set E sat- 
isfies the constraint: if |itw| < a then {u,v} £ E and if 
\uv\ > 1 then {u, v} E. Here we use |w| to denote the 
Euclidean distance between the points corresponding to ver- 
tices u and v. The a-UBG model does not prescribe whether 
a pair of vertices whose distance is in the range (a, 1] are 
to be connected by an edge or not. This is an attempt to 
take into account transmission errors, fading signal strength, 
and physical obstructions. Our algorithm does not need to 
know the locations of nodes of the a-UBG in d-dimensional 
Euclidean space; just the pairwise Euclidean distances. 

The a-UBG model is a higher dimensional generalization 
of the somewhat simplistic unit disk graph (UDG) model 
of wireless networks that is popular in literature. Specifi- 
cally, when a — 1 and d — 2, a d-dimensional a-UBG is 
just a UDG. UDGs are attractive due to their mathemati- 
cal simplicity, but have been deservedly criticized for being 
unrealistic models of wireless networks [TO]. In our view, d- 
dimensional a-UBGs are a significant step towards a more 
realistic model of wireless networks. Two-dimensional a- 
UBGs were proposed in [T] as a model of wireless ad-hoc net- 
works with unstable transmission ranges and the difficulty 
of doing geometric routing in such networks was shown. 

Our communication model is the standard synchronous 
message passing model that does not account for channel ac- 
cess and collision issues. In this communication model, time 
is divided into rounds. In each round, each node can send a 
different message to each of its neighbors, receive different 
messages from all neighbors and perform arbitrary (polyno- 
mial) local computation. The length of messages exchanged 
between nodes is logarithmic in the number of nodes. We 
measure the cost of our algorithm in terms of the number of 
communication rounds. Although this model is not widely 
considered to be realistic, it is nevertheless interesting be- 
cause it demonstrates the locality of computations. 



1.2 Our result 

For any edge weighted graph J, we use w(J) to denote 
the sum of the weights of all the edges in J and MST(J) 
to denote a minimum weight spanning tree of J. For any 
fixed s > 0, < a < 1, and d > 2 our algorithm runs 
in 0(logn • log* n) communication rounds on an n-node, d- 
dimensional a-UBG and computes a (1 + e)-spanner G' of 
G whose maximum degree A(G') = O(l) and whose total 
weight w(G') = 0(w(MST{G)). Since any spanner of G 
has weight bounded below by w(M ST(G)), the weight of 
the output produced by the algorithm is within a constant 
times the optimal weight. As far as we know, our result 
significantly improves all known results of a similar kind 
along several dimensions. More on this further below. 

1.3 Topology control 

Our result is motivated by the topology control problem 
in wireless ad-hoc networks. For an overview of topology 
control, see the survey by Rajaraman [17]. Since an ad-hoc 
network does not come with fixed infrastructure, there is no 
topology to start with and informally speaking, the topol- 
ogy control problem is one of selecting neighbors for each 
node so that the resulting topology has a number of useful 
properties. More precisely, let V be a set of nodes that can 
communicate via wireless radios and for each v £ V, let N(v) 
denote the set of all nodes that v can reach when transmit- 
ting at maximum power. The induced digraph G — (V,E), 
where E — {{u, v} \ v G N(u)}, represents the network in 
which every node has chosen to transmit at maximum power 
and has designated every node it can reach as its neighbor. 
The topology control problem is the problem of devising an 
efficient and local protocol P for selecting a set of neighbors 
Np(v) C N(v) for each node v G V. The induced digraph 
Gp = (V,E P ), where E P = {{u,v} | v G N P (u)} is typ- 
ically required the satisfy properties such as symmetry (if 
v G N P (u) then u G N P (v)), sparseness (\E P \ = 0(\V\)) 
or bounded degree (|A r p(v)| < c for all nodes v and some 
constant c), and the spanner property. Sometimes stronger 
versions of connectivity such as fc-vertex connectivity or k- 
edge connectivity (for k > 1) are desired, both for providing 
fault-tolerance and for improving throughput [BJ|7J. If the 
input graph consists of nodes in the plane, it is quite com- 
mon to require that the output graph be planar [131 1141 
1151 1181 19 . This requirement is motivated by the existence 
of simple, memory-less, geometric routing algorithms that 
guarantee message delivery only when the underlying graph 
is planar [9]. 

Though the topology control problem is recent, there is 
already an extensive body of literature on the problem to 
which the above sample of citations do not do justice. How- 
ever, many of the topology control protocols that provide 
worst case guarantees on the quality of the topology, assume 
that the network is modeled by a UDG. A recent example 
[15] presents a distributed algorithm that requires a linear 
number of communication rounds in the worst case to com- 
pute a planar t-spanner of a given UDG with t ~ 6.2 and in 
which each node has degree at most 25. These two constants 
can be slightly tuned - t can be brought down to about 3.8 
with a significant increase in the degree bound. We improve 
on the result in [151 along several dimensions. As is gener- 
ally known among practitioners in ad-hoc wireless networks, 
the "flat world" assumption and the identical transmission 
range assumption of UDGs are unrealistic [10]. By using 



an a-UBG we significantly generalize our model of wireless 
networks, hopefully moving much closer to reality. For any 
£ > 0, our algorithm returns a (1 + e)-spanner; as far as we 
know, this is the first distributed algorithm that produces an 
arbitrarily good spanner for an a-UBG model of wireless net- 
works. We also guarantee that the total weight of the output 
is within constant times optimal - a guarantee that is not 
provided in [15]. Finally, using algorithmic techniques and 
distributed data structures that might be of independent in- 
terest, we ensure that our protocol runs in 0(logn • log* n) 
communication rounds. We are not aware of any topology 
control algorithm that runs in poly-logarithmic number of 
rounds and provides anywhere close to the guarantees pro- 
vided by our algorithm. 

1.4 Spanners in computational geometry 

Starting in the early 1990's, researchers in computational 
geometry have attempted to find sparse, lightweight span- 
ners for complete Euclidean graphs. Given a set P of n 
points in 1Z d , the tuple (P, E), where E is the set of line seg- 
ments {{p,p'} | p,p' G P}, is called the complete Euclidean 
graph on P. For any subset E' C E, (P, E') is called a Eu- 
clidean graph on P. The specific problem that researchers 
in computational geometry have considered, is this. Given 
a set P of n points in 1Z d and t > 1, compute a Euclidean 
graph on P that is a t-spanner of the complete Euclidean 
graph on P, whose maximum degree is bounded by O(l) and 
whose weight is bounded by the weight of a minimum span- 
ning tree on P. For an early example, see [12] in which the 
authors show that there are "planar graphs almost as good 
as the complete graphs and almost as cheap as minimum 
spanning trees." This was followed by a series of improve- 
ments [21 [3] SJ E] , with the most recent paper [2] presenting 
algorithms for constructing Euclidean subgraphs that pro- 
vide the additional property of fc-fault tolerance. Most of 
the papers mentioned above start with the following simple, 
greedy algorithm. 



Algorithm SEQ-GREEDY (G = {V,E),t) 

1. Order the edges in E in non- decreasing order of length. 

2. E' ^<t>,G'^- (V,E') 

3. For each edge e = {u, v} G E if there is no TO-path in G' 
of length at most t ■ \uv\ 

(a) E' «- E 1 U {e} 

(b) G'^(V,E>) 
Output G' . 

It is well-known [4] that if the input graph G = (V, E) 
is the complete Euclidean graph, then the output graph 
G' = (V, E') produced by SEQ-GREEDY has the following use- 
ful properties: (i) G' is a f-spanner of G, (ii) A(G') = O(l), 
and (iii) w(G') = 0(w{MST(G))). A naive implementation 
of SEQ-GREEDY takes 0(n 3 log n) time because a quadratic 
number of shortest path queries need to be answered on a 
dynamic graph with O(n) edges. Consequentially, papers 
in this area [3] [5] focus on trying to implement SEQ-GREEDY 
efficiently. For example, Das and Narasimhan [4] show how 
to use certain kind of graph clustering to answer shortest 
path queries efficiently, thereby reducing the running time 
of SEQ-GREEDY to 0(n log 2 n). One of the contributions of 
this paper is to show how a variant of the Das-Narasimhan 



clustering scheme can be implemented and maintained effi- 
ciently, in a distributed setting. 

1.5 Summary of our contributions 

In obtaining the main result, our paper makes the follow- 
ing contributions. 

1. We first show that sparse, lightweight t-spanners for 
arbitrarily small t > 1, not only exist for d-dimensional 
a-UBGs, but can be computed using SEQ-GREEDY. Note 
that sparse t-spanners for arbitrarily small values of 
t > 1 do not exist for general graphs. For example, 
there is a classical graph-theoretic result that shows 
that for any t > 1, there exist (infinitely many) un- 
weighted n-vertex graphs for which every t-spanner 
needs Q(n 1+1/{t+2) ) edges (see Page 179 in 

2. We then consider a version of SEQ-GREEDY in which the 
requirement that edges be considered in increasing or- 
der of length is relaxed. More precisely, the edges are 
distributed into 0(log n) bins Bo, B\, B2, . . . such that 
edges in Bi are all shorter than edges in Bi+\. It is 
then shown that any ordering of the edges in which 
edges in Bo come first, followed by edges in B\, fol- 
lowed by the edges in B2, etc., is good enough for the 
correctness of SEQ-GREEDY, even for d-dimensional a- 
UBGs. More importantly, we show that the update 
step in SEQ-GREEDY (Step 3(a)) need not be performed 
after each edge is queried. Instead, a more lazy up- 
date may be performed, after each bin is completely 
processed. Being able to perform a lazy update is crit- 
ical for a distributed implementation; roughly speak- 
ing, we want the nodes to query all edges in a bin in 
parallel and not to have to rely on answers to queries 
on other edges in a bin. 

3. We also use a clustering technique as a way to reduce 
the number of edges to be queried per node. Reducing 
the number of query edges per node, is critical to being 
able to guarantee that the output of our distributed 
version of SEQ-GREEDY does not have too many edges 
incident on a node. 

4. Our next contribution is to show that this relaxed 
version of SEQ-GREEDY can be implemented in a dis- 
tributed setting in O(logn) phases — one phase corre- 
sponding to each bin — such that each phase requires 
0(log* n) communication rounds. Each phase requires 
the computation of maximal independent sets (MIS) 
on some derived graphs. We show that the derived 
graphs are unit ball graphs of constant doubling dimen- 
sion [11] and use the 0(log* n)-round MIS algorithm 
of Kuhn et al [IT]. 

1.6 Extensions to our main result 

Here we briefly report on extensions to our main result 
that we have obtained. They do not appear in this paper 
due to lack of space. 

1. Let G = (V, E) be an edge-weighted graph. For any 
t > 1 and positive integer k, a k-vertex fault-tolerant 
t-spanner of G is a spanning subgraph G' if for each 
subset S of vertices of size at most k, G[V' \ S] is a t- 
spanner of G[V \S]. A fc-edge fault-tolerant t-spanner 
is defined in a similar manner. Using ideas from [2] 



we can extend our algorithm to produce a fc-vertex (or 
a fe-edge) fault-tolerant t-spanner in polylogarithmic 
number of communication rounds. 

2. In this paper, we use Euclidean distances as weights 
for the edges of the input graph G. However, if the 
metric c • |itw| 7 , for positive constant c and 7 > 1, is 
used in place of Euclidean distances \uv\, we can show 
that our algorithm still produces a spanner with all 
three desired properties. Relative Euclidean distances, 
such as the function mentioned above, may be used to 
produce energy spanners. 

3. Let G = (V, E) be an edge-weighted graph. The power 
cost of a vertex u £ V is power(u) = msix{w(u,v) 

v is a neighbor of u}. In other words, the power cost 
of a vertex u is proportional to the cost of u trans- 
mitting to a farthest neighbor. The power cost of G 
is X^ugv power(u) [8]. We can show that the output 
of our algorithm is not only lightweight with respect 
to the usual weight measure (sum of the weights of all 
edges) but also with respect to the power cost measure. 

2. SEQUENTIAL RELAXED GREEDY AL- 
GORITHM 

Now we show that a relaxed version of SEQ-GREEDY pro- 
duces an output G' with all three desired properties, even 
when the input is not a complete Euclidean graph, but is a 
d-dimensional, q-UBG for fixed d and a. Relaxing the re- 
quirement in SEQ-GREEDY that the edges be totally ordered 
by length and allowing for the output to be updated lazily 
are critical to obtaining a distributed algorithm that runs in 
polylogarithmic number of rounds. 

Let r > 1 be a constant to be fixed later and let Wi = 
r l a/n for each i = 0,1,2,.... Let Iq = (0, ct/n] and for 
each i = 1,2,... let h = (W l - 1 ,W t ]. Let m = Hog,- ^1 • 
Then, since no edge has length greater than 1, the length of 
any edge in E lies in one of the intervals 7o, Ii, ■ ■ ■ , I m - Let 
E, = {{u,v} G E : \uv\ € h}. 

We now eliminate the restriction that edges within a set 
Ei be processed in increasing order by length. We run 
SEQ-GREEDY in m + 1 phases: in phase i, the algorithm pro- 
cesses edges in Ei in arbitrary order and adds a subset of 
edges in Ei to the spanner. For < i < m, we use Gi 
to denote the spanning subgraph of G consisting of edges 
Eo U Ei U ■ ■ -UEi. Thus Gi is the portion of the input graph 
that the algorithm has processed in phase i and earlier. We 
use G'i to denote the output of the algorithm at the end of 
phase i. In other words, G'i is the spanning subgraph of G 
consisting of edges of G that the algorithm has decided to 
retain in phases 0,1, ... ,i. The final output of the algorithm 
is G = Gm ■ 

The way Eo is processed is different from the way Ei, i > 
is processed. We now separately describe these two parts. 

2.1 Processing Edges in e 

We start by stating a property of Go that follows easily 
from the fact that all edges in Go are small. 

Lemma 1. Every connected component of Go induces a 
clique in G. 

The algorithm PRQCESS-SHQRT-EDGES for processing edges in 
Eo consists of three steps (i) determine the connected com- 
ponents of Go, (ii) use SEQ-GREEDY to compute a t-spanner 



for each connected component (that is, a clique), and (iii) let 
G'o be the union of the t-spanners computed in Step (2) and 
output G' . The following theorem states the correctness 
of the PROCESS-SHORT-EDGES algorithm. Its proof follows 
easily from the correctness of SEQ-GREEDY. 

Theorem 2. G' satisfies the following properties, (i) 
For every edge {u, v} G Eq, G'o contains a uv-path of length 
at most t ■ \uv\, (ii) A(G' ) = O(l), and (iii) w(G' ) = 
0{w(MST(G))). 

2.2 Processing Long Edges 

We now describe how edges in Ei are processed, for i > 
0. The algorithm PROCESS-LONG-EDGES has five steps: (i) 
computing a cluster cover for G' i _ 1 , (ii) selecting query edges 
in Ei, (iii) computing a cluster graph Hi-i for (iv) 
answering shortest path queries for the query edges selected 
in Step (ii), and (v) removing redundant edges. These steps 
are described in the next five subsections. 

For any graph J, let V(J) denote the vertex set for J. 
For any pair of vertices u, v G V(J) let spj(u,i;) denote 
the length of a shortest Mu-path in J. Define a cluster of J 
with center u G V(J) and radius r to be a set of vertices 
C u C V(J) such that, for each v € C u , spj(u,v) < r. A 
set of clusters {C U1 , C U2 , . . .} of J is a cluster cover of J of 
radius r if every cluster in the set has radius r, every vertex 
in V(J) belongs to at least one cluster, and for any pair of 
cluster centers m and Uj, spj(ui,Uj) > r. 

2.2.1 Computing a Cluster Cover for G' i _ 1 

At the beginning of phase i we compute a cluster cover of 
radius 8Wi-\, where 8 < 1 is a constant that will be fixed 
later. We start with an arbitrary vertex u G V and run 
Dijkstra's shortest path algorithm with source u on C?^_i, 
in order to identify nodes v € V with the property that 
sp G , (u,v) < 8Wi-i; each such node v gets included in 

the cluster C u - Once C u has been identified, recurse on 
V \ C u until all nodes belong to some cluster and we have a 
cluster cover of G' i _ 1 of radius 5Wi-\. 

2.2.2 Selecting Query Edges in Ei 

As defined earlier, edges in Ei have weights in the interval 
Ii = (Wi-i, Wi], while the cluster cover for G' i _ 1 has radius 
5Wi-i, with S < 1. This implies that each edge in Ei has 
endpoints in different clusters. Our goal is to select a unique 
query edge per pair of clusters. This will guarantee that 
there are a constant number of query edges incident on any 
node (see Lemma 2} and this fact will be critically used by 
the distributed version of our algorithm to guarantee the 
degree bound on the spanner that is constructed. 

Let 9 be a quantity that satisfies < 9 < ? and t > 
l/(cos# — SU16 1 ). For any value t > 1, no matter how small, 
there always exists a 9 that satisfies these restrictions. De- 
fine an edge e = {u, v} G Ei to be a covered edge if there is 
a z G V such that (i) {u, z} G GJ-i, \vz\ < a and Zvuz < 9 
or (ii) {v,z} G \uz] < a and Zuvz < 9. Any edge 

in Ei that is not covered is a candidate query edge. The 
motivation for these definitions is the following geometric 
lemma, due to Czumaj and Zhao [2]. 

Lemma 3 (Czumaj and Zhao [2]). Let < 9 < f 
and t > — a 1 ■ a - Let u.v.z be three points in 1Z d with 

— cost/ — sm a ' ' L 

Zvuz < 9. Suppose further that \uz\ < \uv\. Then the edge 



{u, z} followed by a t-spanner path from z to v is a t-spanner 
path from u to v (see Figure]!^. 




Figure 1: (a) Edge {u,v} is covered: {u, z} followed 
by a f-spanner zv-p&th is a f-spanner itu-path. 

Now note that for each covered edge {u, v} G Ei, there 
exists z that satisfies the preconditions of Lemma [3] (by def- 
inition), and using this lemma we can show that G'i_i al- 
ready contains a uw-path of length at most t ■ \uv\. This 
suggests that covered edges need not be queried and there- 
fore we can start with the complement of the set of covered 
edges as candidate query edges. 

For each pair of clusters C a and Cb, let Ei[C a , C&] denote 
the subset of candidate query edges in Ei with one end- 
point in C a and the other endpoint in C a . Our algorithm 
selects a unique query edge {x, y} from each nonempty sub- 
set Ei[C a , Cb]- Assuming that x G C a and y G Cb, the edge 
{x, y} is selected so as to minimize 

t ■ \xy\ - Bp G ,_ i (o,a:) - epa'^JPiV) C 1 ) 

The quantity in {1} is carefully chosen to guarantee that, if a 
t-spanner path between the endpoints of an edge {a;, y} that 
minimizes {1} exists in G'i , then t-spanner paths between the 
endpoints of all edges in Ei[C a , Cb] exist in G'i (this property 
will later be shown in the proof of Theorem llO[l . This implies 
that, for each pair of clusters C a and Cb, querying the edge 
{x, y} in Ei[C a , Cb] that minimizes |l| renders querying any 
other edge in Ei[C a ,Cb] redundant. 

The following lemma shows that selecting query edges as 
described above filters all but a constant number of edges 
per cluster. The proof follows from two observations: (i) if a 
pair of cluster centers are connected by an edge in Ei, then 
the clusters are not too far from each other in Euclidean 
space (in particular, no farther than (45 + r)Wi-i) , and (ii) 
the Euclidean distance between any pair of cluster centers is 
bounded from below by SWi-i/t, because they would oth- 
erwise be part of the same cluster. 

Lemma 4. The number of query edges in Ei that are in- 
cident on any cluster is 0(t d ( 4S ^~ r ) d ), a constant. 

2.2.3 Computing a Cluster Graph 

For each selected query edge {x,y} G Ei, we need to 
know if G' i _ 1 contains an zy-path of length at most t ■ \xy\. 
In general, the number of hops in a shortest xy-path in 
can be quite large and having to traverse such a path 
would mean that the shortest path query corresponding to 
edge {x, y} could not be answered quickly enough. To get 
around this problem, we use an idea from [3] in which the 
authors construct an approximation to G' i _ 1 , called a cluster 
graph, and show that for any edge {x,y} G Ei, the short- 
est path query for {x, y} can be answered approximately 
on Hi-i in a constant number of steps. The goal of Das 
and Narasimhan [4] was to improve the running time of 



SEQ -GREEDY on complete Euclidean graphs, but we show that 
the Das-Narasimhan data structure can be constructed and 
maintained in a distributed fashion for efficiently answering 
shortest path queries for edges belonging to a a-UBG. In 
the following, we describe a sequential algorithm that starts 
with a cluster cover of G' i _ 1 of radius 5Wi-i, and builds a 
cluster graph of G' i _ 1 . This algorithm is identical to 

the one in Das and Narasimhan [4] and is included mainly 
for completeness. 

The vertex set of Hi-i is V and the edge set of Hi-\ 
contains two types of edges: intra-cluster edges and inter- 
cluster edges. An edge {a, x} is an intra-cluster edge if a is 
a cluster center and x is node in C a - Inter-cluster edges are 
between cluster centers. An edge {a, b} is an inter-cluster 
edge if a and b are cluster centers, and at least one of the 
following two conditions holds: (i) sp G , (a, 6) < Wi-i, or 

(ii) there is an edge in Gj_i with one endpoint in C a and 
the other endpoint in Ct,. See Figure [5] 




Figure 2: Edges interior to disks are intra-cluster 
edges. Edge {a, b} is an inter-cluster edge because 
sp G / (a,b) < Wi-i, and {b,c} is an inter-cluster edge 

because {u,v} is in G'i_i. An a;y-path in Gj_i, shown 
by the dashed curve may be approximated by the 
path x,a,b,y in Hi-i. 

Regardless of the type of a cluster edge e = {a, b} (inter- 
or intra-), the weight of e is the value of sp G / i (a,6). The 
following lemma follows easily from the definition of inter- 
cluster edges. 

Lemma 5. For any inter-cluster edge {a,b} in Hi-\, we 
have that sp G , (o, b) < (25 + 

The above upper bound also implies that \ab\ < (25 + 
l)Wi-\. Using this and arguments similar to those used 
for Lemma 31 we can show that the number of inter-cluster 
edges incident to a cluster center is 0((5 + l/5) d ), so we 
have the following lemma. 

Lemma 6. The number of inter- cluster edges in Hi~i in- 
cident to a cluster center is 0((5 + l/5) d ), a constant. 

The main reason for constructing the cluster graph 
is that lengths of paths in Hi-i are close to lengths of corre- 
sponding paths in G' i _ 1 and shortest path queries for edges 
in Ei can be answered quickly in Hi-\. The following lemma 
(whose proof appears in Das and Narasimhan 4.) shows that 
we can construct Hi-i such that path lengths in ap- 
proximate path lengths in G'i_i to any desired extent, de- 
pending on the choice of 5. 

Lemma 7. For any edge {x, y} £ Ei, if there is a path 
between x and y in G' i _ 1 of length L\, then there is a path 
between x and y in Hi-\ of length L2 such that L\ < L2 < 



2.2.4 Answering Shortest Path Queries 

For query edges {a;, y} £ Ei, we are interested in knowing 
whether G' i _ 1 has an xy-path of length at most t ■ \xy\. We 
ask this question on the cluster graph Hi-i. If Hi-i con- 
tains an zy-path of length at most t ■ \xy\, we do not add 
{x, y] to G\\ otherwise we do. If H%-\ contains an xy-path 
of length at most t ■ \xy\, then so does G' i _ 1 (by Lemma [7] 
since L\ < L2). Therefore, not adding {x, y} to the span- 
ner is not a dangerous choice. On the other hand, even if 
Hi-i does not contain an xy-p&th of length at most t ■ \xy\, 
G' i _ 1 might contain such a path and in this case adding 
edge {x,y} is unnecessary. Adding extra edges is of course 
not problematic for the t-spanner property. It will turn out 
that this is not a problem even for the requirement that 
the spanner should have bounded degree and small weight, 
given that paths in Hi-i can approximate paths in G' i _ 1 to 
an arbitrary degree. 

Given the structure of the cluster graph, all but at most 
2 edges in any simple ly-path are inter-cluster edges. Since 
the radius of each cluster is 5Wi-i, each inter-cluster edge 
has weight greater than 5Wi-i. We are looking for a path 
of length at most t ■ \xy\. Since \xy\ £ (Wj-i, W»], we are 
looking for a path of length at most t ■ Wi = t ■ r ■ Wi-i. 
Any simple path in Hi-i of length at most t ■ r ■ Wi-i has 
at most 2 + \tr/S\ hops, which is a constant. This yields 
the following lemma. 

Lemma 8. For any edge {x,y} £ Ei, if sp H . 1 (x,y) < 
t ■ \xy\, then Hi-\ contains a shortest xy-path with O(l) 
hops (no more than 2 + \tr/5~\). 

One issue we need to deal with, especially when attempt- 
ing to construct and answer queries in Hi-\ in a distributed 
setting, is that edges in Hi-i need not be present in the 
underlying network G. Specifically, for an intra-cluster edge 
{u, a}, where C a is a cluster and u € C a , it may be the case 
that \ua\ > a and {u, a} may be absent from G. Similarly, 
an inter-cluster edge {a, 6} in Hi-\ may be absent in G. 
However, for any edge {x,y} in Hi-i (intra- or inter-cluster 
edge) , we have the bound sp G / (x,y) < (25+l)Wi-i. This 

follows from Lemma [5] and the fact that the radius of each 
cluster is 5Wi-i. Thus a shortest ^y-path in G' i _ 1 lies en- 
tirely in a ball of radius (25 + l)Wi-i centered at x. Since 
G' i _ 1 is a spanning subgraph of G, this implies that there 
is a shortest sy-path P in G that lies entirely in the d- 
dimensional ball of radius (2<5 + l)Wi_i centered at x. Since 
any two vertices in P that are two hops away from each other 
are at least a apart (in the d-dimensional Euclidean space), 
P contains at most \2(25 + l)Wi-i/a] < [2(25+l)/af| hops. 
This argument yields the following theorem. 

Theorem 9. For any edge {x,y} £ Ei, ifsp H , 1 (x,y) < 
t ■ \xy\, then G contains a shortest xy-path with O(l) hops 
(no more than \2(25 + 1) /a]). 

This theorem implies that brute force search initiated 
from one of the endpoints, say x, will be able to answer 
the shortest path query on edge {x, y} in O(l) rounds in a 
distributed setting. 

2. 2. 5 Removing Redundant Edges 

Let ti be such that 1 < ti < t. Recall that shortest path 
queries for edges in Ei are answered on and so updates 



to G'i in phase i do not influence subsequent shortest path 
queries in phase i. Thus it is possible that in phase i two 
edges {u,v} and {u',v'} get added to Gi, yet both of the 
following hold: 

(i) sp H ._ i (w, u') + \u'v'\ + sp Hi i (v' ,v) < ti ■ \uv\ 

(ii) sp H (vf, u) + \uv\ + spjy. («, «) < t\ ■ \u'v'\ 

Note that, since sp G / (x,y) < spjj. t (x,y) holds for any 

pair of nodes x and y, and since ti < t, conditions (i) and 
(ii) above imply that G'i contains t-spanner paths from u to 
v and from u to v' . We call two edges {u,v} and {«,«'} 
satisfying conditions (i) and (ii) above mutually redundant: 
one of them could potentially be eliminated from Gi, with- 
out compromising the t-spanner property of Gi. In fact, 
such mutually redundant pairs of edges need to be elimi- 
nated from G'i because our proof that G' has small weight 
(Theorem [13]) depends on the absence of such pairs of edges. 

To do this, we build a graph J that has a node for each 
edge in a mutually redundant pair and an edge between ev- 
ery pair of nodes that correspond to a mutually redundant 
pair of edges in G'i. We construct an MIS I of J and elim- 
inate from G'i all edges associated with nodes in J that do 
not appear in I. 

2.3 The Three Desired Properties 

Let G = G' m be the spanner at the end of phase m. We 
now prove that G' satisfies the three properties that the 
output of SEQ-GREEDY was guaranteed to have. The proofs 
of these theorems form the technical core of the paper and 
are presented next in this section. 

Theorem 10. For any < 5 < —t^, the output G' is a 
t-spanner. 

Proof. We first prove that the theorem holds for all 
query edges in E, then we extend the argument to non-query 
edges as well. Let {x, y} be an arbitrary query edge and let 
i > 1 be such that {x,y} £ Ei. Then either (i) {x,y} is 
added to the spanner in phase i, or (ii) sp H . i (x, y) < t-\xy\. 
If the former is true and {x, y} is not a redundant edge, then 
the theorem holds. If {x, y} is a redundant edge but does 
not get removed from Gi, then again the theorem holds. If 
{x,y} is a redundant edge that gets removed from Gi, then 
at least one mutually redundant counterpart edge must re- 
main in Gi (since removed edges form an independent set), 
ensuring a t-spanner :ri/-path in Gi. If (ii) is true, then from 
Lemma]?] sp G / (x,y) < sp^. ± (x,y) (first part of the in- 
equality) and therefore sp G / (x,y) < t ■ \xy\. 

For non-query edges, the proof is by induction on the 
length of edges in G. The base case corresponds to edges in 
Eo, for which SEQ-GREEDY ensures that the theorem holds. 

Assume that the theorem is true for any edge in E of 
length no greater than some value q, and consider a smallest 
non-query edge {x, y} in G of length greater than q. We 
prove that sp G ,(x,y) < t ■ \xy\. Let i be such that {x,y} £ 
Ei. We now consider two cases, depending on whether {x, y} 
is a candidate query edge in phase i or not. 

If {a;, y} is not a candidate query edge, then it is a covered 
edge. That is, there exists an edge {x, z) in G' i _ 1 such that 
\yz\ < a and Zyxz < 9, or an edge {y, z} in G' i _ 1 such 
that \xz\ < a and Zxyz < 9. The two cases are symmetric 
and so without loss of generality, assume that the former 



is true. Here 9 satisfies the hypothesis of the Czumaj-Zhao 
lemma (Lemma [3}, that is, < 9 < J and t > cos gl sin0 ■ 
Since \yz\ < a and G is an a-UBG, this implies that {y, z} 
is an edge is E. Furthermore, since < 9 < j, we have 
\yz\ < \xy\. Refer to Figure^. If {y, z} is a query edge, 
then by the argument above we have that G' contains a 
t-spanner yz-path p. Otherwise, if {y, z} is not a query 
edge, since its length is less than the length of {x,y}, by 
the inductive hypothesis we get that there is a t-spanner yz- 
path p. In either case, Lemma|3]tells us that {x, z} followed 
by p is a t-spanner path from x to y, completing this case. 




(a) (b) 

Figure 3: (a) {x, y} is a covered edge (b) {u, v} is a 
query edge: if Gi contains a t-spanner mi-path, then 
Gi contains a t-spanner sy-path. 

We now consider the case when {x, y} is a candidate query 
edge in phase i, but not a query edge. Let a and b be such 
that x € C a and y £ Ct, and let {u, v} be the query edge 
selected in phase i, with u £ C a and v £ Cb- Refer to 
Figure |3p. Due to the criteria for selecting {u,v}, we have 

t • \uv\ — sp G , (a,u) — sp G / (b, v) < 

t ■ \xy\ - spg/^ {a, x) - sp^ ^ (b, y). (2) 

Recall that G'i is the partial spanner at the end of phase 
i. We show that sp G ,(x,y) < t ■ \xy\. We discuss two cases, 
depending on whether {u, v} was added to G'i or not. 

Assume first that {u, v} was not added to G'i. This means 
that sp^.^ (it, v) < t ■ \uv\. Note however that 

s VB i . 1 (. U ' V ) = s VG' 1 ^ 1 ( u ' a ) + s PH,_ 1 ( a -: b ) + s PG' i _ 1 ( b ' V ) 

< t-\uv\. (3) 
We now evaluate 

sPg/^ < sp G ,_ i (a;,a)-|-sp G ,_ i (a J 6)-|-sp G ,_ i (6,y) 

< s Pc t _ 1 ( x > a ) + s Ph z _ 1 (a, b) + spg/^ (6, y) 

< t-\xy\. 

This latter inequality involves simple substitutions that use 
inequalities ([2]) and (J3J, and completes this case. 

Now assume that {u, v} was added to G'i. Since u £ C a and 
C a has radius 5Wi-i, we have that sp G / (a, u) < SWi-i. 

Similarly, sp G / (6, v) < 8Wi-\. These together with Q 

yield 

t • \uv\ — 28Wi-\ < t ■ \xy\ — sp G / (a,x) — sp G , (b,y). (4) 

If the edge {it, v} turns out to be redundant and eliminated 
from Gi, the existence of a mutually redundant counterpart 
edge in G'i ensures that sp G ; (u,v) < ti • \uv\. This enables 

us to construct in G'i a path from a to b of weight 



sp G i(a,b) < sp G / (o, u) + ti ■ \uv\ + sp G / (y, b) 

< 28Wi-i+ti-\uv\, (5) 

since sp G / (a, u) < sp G , (a,u) < <5VKi_i, and same for 
sp G / (v, b). We can now construct a path in G' t from x to y 
of weight 

sPci^) ^ sPg;^'^) + s Pg;(M) + s PG'i a ' b ) 

< t ■ \xy\ + 28W l -i- t ■ \uv\ + sp G , (a, b) 

< t-\xy\+45Wi-i-{t-ti)-\uv\ 

< t-\xy\+A5Wi-x-{t-tx)Wi-i 

In deriving this chain of inequalities, we have used (j4|, ((5J 
and the fact that \uv\ > Wi-i. Note that for any S < 
the quantity A8Wi-i — (t — 1) • Wi-i above is negative, 
yielding sp G (x,y) < t ■ \xy\. This completes the proof. □ 




Figure 4: (a) Region \ contains two neighbors v and 
z of u. (b) Definition of the t-leapfrog property with 

S = {{ui,Ul}, {li 2 ,W2}, {u 3 ,V 3 }}. 



Theorem 11. G' has O(l) degree. 

Proof. Let 6 be a quantity satisfying the conditions of 
Lemma [3] Fix a vertex u and consider the ei-dimensional 
unit radius ball centered at u. For some T that depends 
only on 8 and d, this ball can be partitioned into T cones, 
each with apex u, such that for any x, y in a cone, Zxuy < 
Yao [20] shows how to construct such a partition with 
T = 0{d 3/2 ■ s\rT d {e/2) ■ log(dsin _1 (6l/2))) cones. Place 
an infinite axis-parallel grid of d-dimensional cubes, each 



of dimension x x 



x on the plane. See Fig- 



ure |4fc) for a 2-dimensional version of this picture. There 
are 0(l/a d ) cells that intersect the unit ball centered at 
it, and therefore there are 0(l/a d ) cells that intersect each 
cone in the cone partition of this unit ball. Thus the cones 
and the square cells together partition the unit ball cen- 
tered at it into 0(T/a d ) regions. We show that in G' , u has 

0( f ) neighbors in each region, which is a constant. 

Let vi,V2, ■ ■ ■ ,Vk be neighbors of u in G' that lie in a 
region \- Without loss of generality, assume that \uv\\ > 
\uvj\, for j — 2,...,k, and let i be such that {u,vi} G 
Ei. Since \uvj\ < \uvi\, we have that for all j — 2, . . . , k, 
{u,Vj} £ E e , with I < i. 

We now prove that {u,Vj} is in fact in Ei for all j. To 
derive a contradiction, assume that there is a j > 1 such 
that {u, Vj} £ Et, with £ < i. This means that just before 
edge {u, vi} is processed, G' contains edge {u,Vj}. Also 
note that since vi and Vj lie in the same region, \viVj \ < a. 
But, this means that {u, vi} is a covered edge in phase i and 
will not be queried. This contradicts the presence of edge 
{u, vi} in G' . 



We have shown that {«, Vj} £ Ei for all j. Recall that 
our algorithm picks a unique query edge per pair of clusters. 
This along with Lemma [3] proves that k is constant. □ 

In the next theorem, we show that the spanner produced by 
the algorithm has small weight. The proof relies on the line 
segments in the spanner satisfying a property known as the 
leapfrog property [2] [S]. For any t > > 1, a set of line 
segments, denoted F, has the (t2,t) -leapfrog property if for 
every subset S = {{ui, ui}, {u2,V2}, . . . , {u s ,v s }} of F 



t2 ■ \uiVl\ < \uiVi\ +t ■ ( 2^ \ViUi+l\ + \v s ui\ 



(6) 



Informally, this definition says that if there exists an edge 
between ui and v± , then any path not including {iti , vi } 
must have length greater than t2|ui^i| (see Figure |4jc) for 
an illustration of this definition). The following implica- 
tion of the (t2, t)-leapfrog property was shown by Das and 
Narasimhan [J. 

Lemma 12. Let t > t2 > 1- If the line segments F in 
d- dimensional space satisfy the (t2,t) -leapfrog property, then 
wt(F) = 0(wt(MST)), where MST is a minimum spanning 
tree connecting the endpoints of line segments in F. The 
constant in the asymptotic notation depends on t, t2 and d. 

Theorem 13. Let < S < mm{(t - l)/(6 + 2t),{t - 
ti)/4}. Let t s denote U ■ (1 - 2<5)/(l + 65). Let 1 < r < 
(t$ + l)/2. When the relaxed greedy algorithm is run with 
these values of 5 and r, the output G' satisfies w(G') — 
0{wt{MST{G))). 

Proof. Let /3 > 1 be a constant picked as follows. When 
ta < 1, pick P satisfying 1 < j3 < min{2, 1/(1 - to)}. Oth- 
erwise, pick /3 satisfying 1 < /3 < 2. Partition the edges 
of G' into subsets Fq,Fi, . . . such that Fo = {{u, v} g G' \ 
\uv\ < a} and for each j > 0, Fj = {{u, v} G G' j q/3 j_1 < 
\uv\ < a/?-'}. Let £ — [log^ — ]. Then every edge in G' is in 
some subset Fj, < j < £. We will now show that each Fj 
satisfies the (t2, i)-leapfrog property, for any t2 satisfying: 

1 < ; 2 < min{ !i±i _ 1,2* 2 Q+ l 
r r r p p 

It is easy to check that our choice for 5, r, and [3 guarantee 
that each quantity inside the min operator is strictly greater 
than 1. Showing the (t2, t)-leapfrog property for Fj would 
imply that w(Fj) = 0{w(MST(G))), and since the edges 
of G' are partitioned into a constant number of subsets Fj , 
w(G') = 0(w(MST{G))). 

Consider an arbitrary subset S = {{ui,vi}, {112,^2}, . . . , 
{u s , u,,}} C Fo . To prove inequality for 5*, it suffices to 
consider the case when {ui,vi} is a longest edge in S. We 
consider Fq separately from Fj, j > 0. 

The Fo case. If for any 1 < k < s, |iifcMfc+i| > |ttit>i| 
or |i! 3 ui| > |ui«i|, then the leapfrog property holds. So 
we assume that for all 1 < k < s, \vkUi t+1 \ < \mvi\ and 
I u s ui I < |ttivi|. Let i be the phase in which {ui,v\} gets 
processed, i.e., G Ei. Since |tiiiii| < a, it is the case 

that for all 1 < k < s, \vkUk+i\ < a and |DsUi| < a. Hence, 
{{v s ,ui}} U {{vk,Uk+i} I 1 < k < s} is a subset of edges 
of G and each edge in this set gets processed in phase i or 
earlier. 

Assume first that at least one edge in the set {{v s , Mi}} U 
{{vk,n,k+i} 1 < k < s} gets processed in phase i. Then 



the right hand side of inequality © is at least tWi-i, since 
edges in Ei have weights in the interval Ii = rWi-i]. 
Also since i2|wi^i| < t^rWi-x, and since the inequality 
t2rWi-i < tWi-i is guaranteed by the values of r and t? 
in ([7]), the leapfrog property holds for this case. 

Assume now that all edges in {{v s ,ui}} U {{vk,Uk+i} \ 
1 < k < s} have been processed in phase i — 1 or earlier, 
meaning that t-spanner paths between their endpoints exist 
in G'i_i at the time {ui,vi} gets processed. For 1 < k < s, 
let Pk be a shortest w fe u fc+ i-path in G' i _ 1 , and let P s be a 
shortest u s ui-path in G' i _ 1 . Let P be the following uivi- 
path in G-: P = Pi © {u 2 ,w 2 } © P 2 © {^3,1*3} ® Pa- 

Here, we use © to denote concatenation. We distinguish 
three cases, depending on the size of the subset S PI Ei. 

(i) \SC\Ei\ > 2. Then, w(P) > 2Wi-i. We also have that 
|mi«i| < rWi-i, since {ui,vi} £ Ei. It follows that 
w(P) > t2|wi^i| for any t 2 < §■ Furthermore, w(P) 
is no greater than the right hand side of the (t 2 ,£)- 
leapfrog inequality ([SJ , so lemma holds for this case as 
well. 

(ii) \S n Ei\ — 2. In addition to {ui,vi}, assume that 
{uk,Vk} 6 Ei for some k, 1 < k < s. It the (i 2 ,£)- 
leapfrog inequality (|6]) holds, we are done and so let 
us assume the opposite of that: 

s s — 1 

i 2 • \uivi\ > 2~2 \uiVi + 1 ■ ^2 I + \v s ui\ j . (8) 

i = 2 i = l 

Since all edges {uj, Vj}, 1 < j < s, except for {ui,vi} 
and {iik,Vk} are in G' i _ 1 , and since G' i ^ 1 contains t- 
spanner UjUj+i-paths for all j, 1 < j < s, and a t- 
spannner « s iii-path, the above inequality yields 

t2 ■ \uiVi\ > Sp Q / (wijUfc) -+- |«fc«fc| + Sp G / 

Multiplying both sides by (1 + 65) /(l — 25) and using 
t 2 < ts (which is implied by our choice of t 2 ) and 
Lemma [7] we get 

ti ■ \uiVi\ > sp H ,_ i (vi,Uk) + \u k Vk\ + sp H 

(9) 

Let A = YltZi \ + |i>8Wi|. We now observe that 

t s -\u k vk\<^2\uiVi\+ \um\+t-A (10) 

i=l i = fc+l 

implies the (t 2 , t)-leapfrog property. To see this use the 
fact that both {ui,vi} and {uk,Vk} belong to Ei and 
therefore luiuij < r ■ \v,kVk\, which substituted in (|10|l 
yields: 

s 

ts ■ \u k v k \ - (r — 1) ■ \u k v k \ < 22 \ U i V i\ +* • A. 

i = 2 

We get the lower bound f 2 • |ui«i| on the left hand 
side of the above inequality by using |itfci>fc| > |wiwi|/r 
again and our choice of ti < (ts + l)/r — 1. This 
yields the (t 2 , i)-leapfrog property. So we assume that 
inequality (j 1 Of) does not hold, that is, 

fc-l s 
tg ■ \UkVk\ > / ] \UjVj\ + 22 +* ' A- 

i=l i=fc+l 



Since all edges {iij, Vj}, 1 < j < s, except for {ui,v\} 
and {uk,Vk} are in Gi_i, and since G£_i contains t- 
spanner Wjiij+i-paths for all j, 1 < j < s, and a t- 
spannner « s iti-path, the above inequality yields 

ts ■ \uicVk\ > sp G ,_ («i,Ufc) + \uivi\ + sp G /_ (v k ,ui). 

Multiplying both sides by (1 + 65) /(l — 28) and using 
Lemma [7] we get 

ti • > Bp H ._ 1 (ui,ttfc) + [mivi] + sp Hi _ i (« fc ,ui). 

(11) 

Inequalities © and imply that edges {«i , v\ } and 
{u 2 ,u 2 } are mutually redundant and therefore cannot 
both exist in the spanner — a contradiction. 

(in) \S Pi Ei I = 1. This means that P exists in G' i _ 1 at 
the time {iti,i;i} is processed. Furthermore, w(P) > 
t-\uivi\ > t 2 -|itiui|, otherwise {ui,v\} would not have 
been added to the spanner, a contradiction. 

The Fj case, j > 0. In this case, |itfcffc | > \u\V\\/ f3 for 
all k — 2,3, ...,s. If ISj > 3, then the right hand side 
of the (t2, i)-leapfrog inequality ((6| is at least 2 ■ \uiv\\/(i 
and therefore the (t 2 , f)-leapfrog inequality goes through for 
any 1 < t 2 < 2//3. Otherwise, if \S\ = 2, then we need to 
show that t2 ■ \u\V\\ < |it 2 i; 2 | + t ■ (|mid 2 | + |it2fi|). If each 
of |miV2| and |M2Wi| is at most a, then using the same ar- 
gument as in the Po-case with \S fi Ei\ — 2, we can show 
that {ui,i;i} and {112,^2} are mutually redundant and will 
not both exist in the spanner. Otherwise, if one of |uiti2| 
or |u2fi| is greater than a, then the right hand side of the 
(t2, £)-leapfrog inequality (]§} is greater than |ui«i|//3 + ta. 
To ensure that the inequality goes through, we require that 
t 2 -K«i| < ] ^ A +ta. Since | M1W1I < 1, the above inequal- 
ity is satisfied for any 1 < ti < ta + 4 , which holds true 
cf. (JjJ. □ 

3. DISTRIBUTED RELAXED GREEDY AL- 
GORITHM 

We now describe a distributed version of the relaxed greedy 
algorithm from Section[2] Like the sequential relaxed greedy 
algorithm, this algorithm also runs in O(logn) phases — 
with edges in Ei being processed in phase i. We will show 
that edges in Eo can be processed in O(l) rounds. Re- 
call that each subsequent phase consists of the following 
five steps: (i) computing a cluster cover of G;_!, (ii) se- 
lecting query edges in Ei, (iii) computing a cluster graph 
Hi-i of Gi_i, (iv) answering shortest path queries for se- 
lected query edges, and (v) deleting some redundant edges. 
We will show that Steps (ii), (iii), and (iv) can be completed 
in O(l) rounds and Steps (i) and (v) take 0(log* n) rounds. 
Step (i) and Step (v) will each involve computing an MIS in 
a certain derived graph and in both cases, we will show that 
the derived graph is a UBG that resides in a metric space 
of constant doubling dimension. Putting this all together, 
we will show that the algorithm runs in 0(log n ■ log* n) 
communication rounds. 

3.1 Distributed Processing of Short Edges 

Lemma Q] implies that vertices in the same component of 
Go = G[Po] induce a clique and therefore can communicate 
in one hop with each other. In the distributed version of the 
algorithm, each vertex u obtains the topology of its closed 



neighborhood along with pairwise distances between neigh- 
bors in one hop. Using this information, u determines the 
connected component C of Go that it belongs to. Then u 
simply runs SEQ-GREEDY on C and computes a i-spanner of 
C. Finally, u identifies the edges of the t-spanner incident 
on itself and informs all its neighbors of this. 

Theorem 14. The edges in Eo can be processed in O(l) 
rounds of communication. 

3.2 Distributed Processing of Long Edges 

In this section, we show how long edges, that is, edges in 
Ei, i > 0, can be processed in a distributed setting. The first 
step of this process is the computation of a cluster cover for 
the spanner G' i _ 1 updated at the end of the previous phase. 

3.2.1 Distributed Cluster Cover for G' i _ 1 

Recall that in this step our goal is to compute a clus- 
ter cover {C U1 ,C U2 , ■ ■ ■} of G' i _ 1 of radius 5Wi-i. To do 
this, each node u first identifies all nodes v in G satis- 
fying sp G , {u,v) < SWi-i. Using arguments similar to 
those in Section [2]2]4j we can show that any node v satisfy- 
ing sp G / {u,v) < 8Wi-i must be at most 2SWi~i/a hops 

from u. So each node u constructs the subgraph of G' i _ 1 
induced by nodes that are at most 25Wi-i/a hops away 
from it in G. Node u then runs a (sequential) single source 
shortest path algorithm with source u on the local view of 
G'i_i it has obtained and identifies all nodes v satisfying 

Sp G , (u, V) < SWi-l. 

At the end of the above process, every node u in the net- 
work is a cluster center. We now force some nodes to cease 
being cluster centers, so that all pairs of cluster centers are 
far enough from each other. Let J be the graph with vertex 
set V and whose edges {x,y} are such that x G C y (and by 
symmetry, y G C x ). 

Lemma 15. J is a UBG that resides in a metric space of 
constant doubling dimension. 

Proof. For any edge {x,y} in J, we have that x G C y 
and therefore sp G / (x,y) < 6Wi-i. Assign to every pair 

of nodes {x,y} in V a weight w(x,y) — sp G , {x,y). The 
weights w form a metric simply because shortest path dis- 
tances in any graph form a metric. Thus J is a graph whose 
nodes reside in a metric space and whose edges connect pairs 
of nodes separated by distance of at most SWi-i (in the met- 
ric space). By scaling the quantity SWi-i up to one, we see 
that J is a UBG in the underlying metric space defined by 
the weights w. Recall from [llj that the doubling dimen- 
sion of a metric space is the smallest p such that every ball 
can be covered by at most 2 P balls of half the radius. To see 
that the metric space induced by the weights w has constant 
doubling dimension, start with a ball of B radius R centered 
at an arbitrary vertex u. Every vertex v in ball B satisfies 
sp j (u, v) < R. Now cover the vertices in B using balls of 
radius R/2 as follows: repeatedly pick an uncovered vertex 
v in B and grow a ball of radius R/2 centered at v, until all 
vertices have been covered. We now show that the number 
of balls of radius R/2 is constant. 

Let a and b be two arbitrary centers of different balls of 
radius R/2. Then spj(u,v) > R/2, otherwise a and 6 would 
belong to the same ball of radius 72/2. We distinguish three 
situations: 



• {a,b} is not an edge in G. This implies that \ab\ > a. 

• {a, b} is an edge in G that has not been processed prior 
to phase i. This implies that \ab\ > Wi-i > 1/8 (after 
scaling 8Wi-i up to 1). 

• {a, b} is an edge in G that has been processed prior to 
phase i. This implies that G\_x contains a i-spanner 
path from a to & and therefore \ab\ > sp G , (a, b)/t > 

R/2t. 

We have established that \ab\ > min{«, i, ||}, so no two ball 
centers can be too close to each other. It follows that the 
number of balls of radius R/2 that fit inside B is constant, 
proving the lemma true. □ 

Let I be an MIS of J constructed using the MIS algorithm 
in pT]. This algorithm runs in 0(log* n) communication 
rounds on a UBG that resides in a metric space of constant 
doubling dimension. Then each node in V\I has one or more 
neighbors in /. Each node u G / is declared a cluster center, 
and each node v G V \ I attaches itself to the neighbor in I 
with the highest identifier. This gives us the desired cluster 
cover of radius 8Wi-i. 

Theorem 16. A cluster cover of G'i_i of radius SWi-i 
can be computed in 0(log*n) rounds of communication. 

3.2.2 Distributed Query Edge Selection 

Only nodes that are cluster heads need to participate in 
the process of selecting query edges. Each cluster head a 
seeks to gather information on all edges in Ei between the 
cluster C a and any other cluster Cb- Using the argument 
in Section [2.2.41 we know that every node in C a is at most 
2SWi-i/a hops away from a in G. Therefore, if there is 
an edge {u, v} G Ei, u G C a and v G Cb, then v is at most 
l + 2SWi-i/a hops away from a. So a gets information from 
nodes that are at most 1 + 28Wi-\/a hops away from it and 
it identifies all edges in Ei [C a , Cb] ■ Recall that this is the 
set of edges in Ei which connect a node in C a and a node in 
Cb- Node a then discards all covered edges from Ei[C a , Cb], 
leaving only candidate query edges in Ei between C a and 
Cb- Finally, from among the candidate query edges, node a 
selects an edge {«, v} that minimizes t ■ \uv] — sp G / (a, u) — 

Theorem 17. Query edges from Ei can be selected in 
O(l) rounds of communication. 

3.2.3 Distributed Construction of the Cluster Graph 

As in the query edge selection step, only the cluster heads 
need to perform actions to compute the cluster graph. Any 
member u of a cluster C a lies at most 28Wi-i/a hops away 
from a in G. Thus a can identify intra-cluster edges inci- 
dent on it by gathering information from at most 28Wi~\ / a 
hops away. If Cb is a cluster with sp G / (a, b) < W»_i, then 

node a can identify the inter-cluster edge {a, b} by gather- 
ing information from at most 2Wi-\/a hops away. If Cb 
is a cluster such that there is an edge {u, v} in G' i _ 1 with 
u G C a and u £ d, then node a can identify the inter- 
cluster edge {o, b} by gathering information from at most 
2(25+ l)Wi-i/a hops away. Note that the information that 



a gathers contains a local view of G;_x along with all pair- 
wise distances. Using this information, node a is able to run 
a single source shortest path algorithm with source a and 
determine the weights of all inter-cluster and intra-cluster 
edges incident on a. 

Theorem 18. Computing the cluster graph Hi-i of Gj_i 
takes O(l) communication rounds. 

3.2.4 Answering Shortest Path Queries 

Each node u knows all the query edges incident on it. 
As proved in Section 12.2.41 node it only needs to gather 
information from nodes that are at most a constant number 
of hops away, to be able to determine locally, for all incident 
query edges {u, v} G Ei, whether sp H ._ (u,v) < t ■ \uv\. 
Thus, after constant number of communication rounds, it 
knows the subset of incident query edges {u, v} for which 
sp H (u, v) > t ■ \uv\ and it identifies these as the incident 

edges to be added to G;. 

Theorem 19. Answering shortest path queries takes O(l) 
communication rounds. 

3.2.5 Distributed Removal of Redundant Edges 
Two edges {u,v} and {u',v'} in G\ are mutually redun- 
dant if (i) sp w . (it, u') + \u'v'\ + spjj. (t/, v) < ti ■ \uv\ and 
(ii) sp H . (u', w)+|ww|+sp JJ -. (v, «') < ti-|u'u'|. Each node 
it takes charge of all edges {u, v} added to Gi in phase i and 
for which the identifier of it is higher than the identifier of 
v. For each such edge {it, v} that it is in charge of, u deter- 
mines all edges {it',i/} such that {u,v} and {lt',i/} form a 
mutually redundant pair. Note that the nodes it and v are 
a constant number of hops away from each other in G, and 
similarly for nodes v and it'. Node it then contributes to 
the construction of the graph J by adding to V(J) a vertex 
for each redundant edge it is in charge of, and to E(J) an 
edge connecting nodes in V(J) that correspond to mutually 
redundant edges in Gi. We now show the following property 
of J: 

Lemma 20. J is a UBG that resides in a metric space of 
constant doubling dimension. 

Proof. Let a and b be vertices in J corresponding to 
edges {u a ,v a } and {u b ,v b } in G;. Assign to the vertex pair 
(a, b) a weight equal to 

dj(a,b) = min(sp Jf ._ i (u a , u b ) + sp H ._ i (v a , v b ), 

First we show that the weights defined by dj form a met- 
ric. Clearly dj(a,a) — and dj(a,b) = dj(b,a). To prove 
the triangle inequality, consider three vertices a,b,c G J. 
Assume w.l.o.g. that 

dj(a,b) = sp Hji (it a , it b ) + sp H ._ i (v a ,v b ) 
dj(b,c) = sp Hj i (ub, u c ) +sp H ._ x (v b ,V c ) 

We identify two possible scenarios: 

(1) dj(a,c) = sp Hii (u a ,u c ) + sp H ._ i (v a , v c ) (see Fig- 
ure [Sp). Since sp is itself a metric, it follows immedi- 
ately that dj(a, c) < dj(a, b) + dj(b, c). 




c u c v c u c v c 

(a) (b) (c) 

Figure 5: d.j is a metric: (a) Nodes a,b,c G J cor- 
respond to edges {u a ,u b }, {ub,u c }, {u a ,u c } G G[. (b) 
dj(a,c) = sp Hi i (u a ,u c ) + sp Hi i (v a ,v c ) (c) dj(a,c) = 

(2) dj(a,c) = sp i?i _ l (u a , v c ) + sp Hii (v a ,u c ) (see Fig- 
ure[5j:). Then it must be that dj (a, c) < sp H . (u a ,u c ) 
+ sp Hi _ i (v a ,v c ) < dj(a,b) + dj(b,c), cf. scenario (1). 

We have shown that dj defines a metric. We now show 
that J is a quasi-UBG residing in the metric space defined 
by dj. For each edge {a, b} in J, the following redundancy 
conditions hold: 

(a) sp H ._ i (u a ,Ub) + sp Hi i (v b ,v a ) < h ■ \u a v a \ - \u b v b \ 

(b) sp Hi i (u b ,u a ) + sp Hi i (v a ,v b ) < ti ■ \u b v b \ - \u a v a \ 

Recall that {u a ,v a } and {1*6,1;;,} are both in Ei, for some 
i > 0. This implies that their lengths differ by a factor 
of r at the most: Wi-i < \u a v a \ < r ■ Wi-i and Wi-\ < 
\u b v b \ < r ■ Wi-i. Thus the right hand side of inequalities 
(a) and (b) above is a quantity that lies in the interval ((ti — 
r)Wi-i, (tir — l)Wi_i). By scaling (tir — l)Wi_i up to one 
we can say that J is an * X ~S 1 - qUBG in the underlying 
metric space defined by d.j. 

It remains to show that the metric space defined by dj 
has constant doubling dimension. Throughout the rest of 
the proof we use Bj (Bh) to denote a ball in the metric 
space defined by dj (sp H ._ ). 




(a) (b) 



Figure 6: (a) The metric space defined by (a) sp H 
and (b) dj, has doubling dimension. 

Consider a ball Bj(x,R) of radius R centered at an arbi- 
trary vertex x G J corresponding to edge {u x ,v x } G Hi-\. 
Let a, b G J be such that dj(a,b) > R/2 (see Figure [6}. As- 
sume w.l.o.g that dj(a, b) = sp Hii (u a , u b ) + sp Hil (v a ,v b ). 
Then at least one of the following must be true: 

(i) sp Hi i (u a ,Ub) > R/A 

(ii) sp Ht _ 1 (u„,w () ) > R/i. 



We use these observations, along with the fact that sp ff . 
defines a metric space of constant doubling dimension, to 
show that dj defines a metric space of constant doubling 
dimension. 

To cover all vertices in Bj(x,R), do the following re- 
peatedly: (i) pick an uncovered vertex a £ J (ii) grow a 
ball Bj(a, R/2), and (iii) grow two balls Bn{u a , -R/4) and 
B H (v a , -R/4) in H^, where a = {u a ,v a }- 

Arguments similar to the ones used in Lemma fTS] show 
that, for any ball centers u a ,Ub G Bh{u x ,R), we have that 
|watti>| > min{a, i, ||}. Since no two ball centers can be too 
close to each other, it follows that Bh(u x , R) gets covered 
by a constant number of balls of radius -R/4, and similarly 
for Bh(v x , R)- Conform observation (ii) above, correspond- 
ing to each uncovered a G J, there is an uncovered vertex 
u a or v a in These together show that the number 

of balls covering Bj(x,R) is constant, thus completing the 
proof. □ 

Let I be an MIS of J constructed using the MIS algorithm in 
[11] that takes 0(log* n) communication rounds on a UBG 
that resides in a metric space of constant doubling dimen- 
sion. Each node u then removes from Gi all incident edges 
in V(J)\I. 

Theorem 21. Removing redundant edges takes 0(log* n) 
communication rounds. 

4. FUTURE WORK 

The results presented in this paper apply to a-UDGs em- 
bedded in constant-dimension Euclidean spaces, and do not 
directly generalize to doubling metric spaces. For low di- 
mensional doubling metric spaces, we believe it possible to 
construct an 0(lognlog* n) distributed algorithm that pro- 
duces a (1 + e)-spanner with constant maximum degree. 
However, new techniques may be needed for lightweight 
spanners; the techniques presented in this paper use a key 
property (the leapfrog property) that does not seem to gen- 
eralize to metrics of doubling dimension. 
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